Question Answering using Vector Based Information Retrieval Paradigm with Word Sense Disambiguation
نویسنده
چکیده
Vector Based Text Classification in the Question Answering field has long been explored. However, there has not been any attempt so far to take word senses into consideration in the development of the feature sets in the classifier. This paper aims to investigate the performance of a question answering text classifier built using not just the root form of words but also taking the senses of those words into thought. Having done a 10folded cross validation, the classification error rate using the tri-gram model actually shows that there is a significant improvement when the word sense of a word is actually known. A chi squared analysis performed to see the correlation between the use of word sense and the affect on the classification accuracy shows a 97.5% confidence level. This simply tells us that the usage of word sense in building the classifier has indeed a strong association with the classification accuracy.
منابع مشابه
Word Sense Disambiguation by Machine Learning Approach: A Short Survey
There is a renewed interest in word sense disambiguation (WSD) as it contributes to various applications in natural language processing. Applications for which WSD is potentially an issue are: Machine Translation, Information Retrieval (IR), QA systems, Dialogue systems,etc. In this paper we survey vector-based methods for WSD in machine learning approache.
متن کاملUofL: Word Sense Disambiguation Using Lexical Cohesion
One of the main challenges in the applications (i.e.: text summarization, question answering, information retrieval, etc.) of Natural Language Processing is to determine which of the several senses of a word is used in a given context. The problem is phrased as “Word Sense Disambiguation (WSD)” in the NLP community. This paper presents the dictionary based disambiguation technique that adopts t...
متن کاملMultiwords and Word Sense Disambiguation
This paper studies the impact of multiword expressions on Word Sense Disambiguation (WSD). Several identification strategies of the multiwords in WordNet2.0 are tested in a real Senseval-3 task: the disambiguation of WordNet glosses. Although we have focused on Word Sense Disambiguation, the same techniques could be applied in more complex tasks, such as Information Retrieval or Question Answer...
متن کاملNTT's Japanese-English Cross-Language Question Answering System
This paper describes NTT’s Japanese-English Cross-Language Question Answering System SAIQAJ/E. The system performed best among eight systems that participated in the Japanese-English subtask of the NTCIR Cross-Language Question Answering task. For cross-language document retrieval, we used a dictionary-based approach without word sense disambiguation. We used a synonym operator to represent tra...
متن کاملSense-based Blind Relevance Feedback for Question Answering
This paper addresses the problem of enhancing document retrieval under the specific restrictions posed by the Question Answering scenario. In particular, given an input question, we aim at defining a reliable method for expanding its keywords with semantic information extracted from WordNet (e. g. synonyms or hypernyms). This is a challenging task, since it is intrinsically dependent on high qu...
متن کامل